Comparing data samples represented by characteristic functions

ABSTRACT

According to certain embodiments, a first characteristic function representing a first set of samples and a second characteristic function representing a second set of samples are generated. The first characteristic function and the second characteristic function are transformed to a first arithmetic function and a second arithmetic function, respectively. A first hash code and a second hash code are calculated from the first arithmetic function and the second arithmetic function, respectively. If the first hash code equals the second hash code, the first set of samples and the second set of samples are designated as equivalent; otherwise, the first set of samples and the second set of samples are designated as not equivalent.

TECHNICAL FIELD

This invention relates generally to the field of data systems and morespecifically to comparing data samples represented by characteristicfunctions.

BACKGROUND

Sensors may be used in different situations (such as medical,environment, and other situations) to take measurements over time. Incertain cases, the measurements may yield a relatively large volume ofdata, which may be difficult to analyze. Techniques may be used toprocess (such as store, utilize, and/or analyze) large volumes of data.

SUMMARY OF THE DISCLOSURE

In accordance with the present invention, disadvantages and problemsassociated with previous techniques for processing data may be reducedor eliminated.

According to certain embodiments, a first characteristic functionrepresenting a first set of samples and a second characteristic functionrepresenting a second set of samples are generated. The firstcharacteristic function and the second characteristic function aretransformed to a first arithmetic function and a second arithmeticfunction, respectively. A first hash code and a second hash code arecalculated from the first arithmetic function and the second arithmeticfunction, respectively. If the first hash code equals the second hashcode, the first set of samples and the second set of samples aredesignated as equivalent; otherwise, the first set of samples and thesecond set of samples are designated as not equivalent.

Certain embodiments of the invention may provide one or more technicaladvantages. A technical advantage of one embodiment may be that sensordata may be represented by a characteristic function that can be storedas a binary decision diagram. Another technical advantage of oneembodiment may be that a search query may be represented by a queryfunction. The search query and the characteristic function may be usedto obtain sensor values of the sensor data that satisfy the searchquery.

Another technical advantage of one embodiment may be that model sensordata for a particular annotation may be represented by a particularannotated model characteristic function. The annotated modelcharacteristic function may be combined with a characteristic functionto annotate the characteristic function with the annotation. Anothertechnical advantage of one embodiment may be that Boolean functions maybe transformed to arithmetic functions. Hash codes may be calculatedfrom the arithmetic functions. If the hash codes are equal, then theBoolean functions may be designated as equivalent.

Certain embodiments of the invention may include none, some, or all ofthe above technical advantages. One or more other technical advantagesmay be readily apparent to one skilled in the art from the figures,descriptions, and claims included herein.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the present invention and itsfeatures and advantages, reference is now made to the followingdescription, taken in conjunction with the accompanying drawings, inwhich:

FIG. 1 illustrates an example of system that may be used to processsensor data;

FIG. 2 illustrates an example of a method for representing sensor databy characteristic functions;

FIG. 3 illustrates an example of a method for querying sensor datarepresented by characteristic functions;

FIG. 4 illustrates an example of a method for annotating characteristicfunctions; and

FIG. 5 illustrates an example of a method for determining whethercharacteristic functions are equivalent.

DETAILED DESCRIPTION OF THE DRAWINGS

Embodiments of the present invention and its advantages are bestunderstood by referring to FIGS. 1 through 5 of the drawings, likenumerals being used for like and corresponding parts of the variousdrawings.

FIG. 1 illustrates an example of system 10 that may be used to processsensor data. In certain embodiments, system 10 may represent sensor databy a characteristic function that can be stored as a binary decisiondiagram. In certain embodiments, system 10 may represent a search queryby a query function. The search query and the characteristic functionmay be used to obtain sensor values of the sensor data that satisfy thesearch query.

In certain embodiments, system 10 may represent model sensor data for aparticular annotation by a particular annotated model characteristicfunction. The annotated model characteristic function may be combinedwith a characteristic function to annotate the characteristic functionwith the annotation. In certain embodiments, system 10 may transformBoolean functions (such as characteristic functions) to arithmeticfunctions. Hash codes may be calculated from the arithmetic functions.If the hash codes are equal, then the Boolean functions may bedesignated as equivalent.

In the illustrated embodiment, system 10 includes an interface (IF) 20,a computing system 22, a memory 24, and a sensor system 28 coupled asshown. Computing system 22 includes one or more processors 29. Logic 26includes a binary decision diagram (BDD) generator 30, a query engine32, a model engine 34, and a signature engine 36, and may be stored bycomputing system 22 and/or memory 24. Memory 24 stores sensor data 40and a BDD library 41. Sensor system 28 includes one or more sensors 50.

In certain embodiments, sensors 50 of sensor system 28 measure features(such as medical or environmental features) to yield measurements (suchas medical or environmental measurements), which are sent to computingsystem 22. A measurement is typically expressed as a numerical value.

Examples of sensors 50 may include medical, environmental, and/or othersuitable sensors. Medical sensors may be used to measure one or morefeatures of a patient's medical state. Medical sensors may includemedical monitors, medical laboratory equipment, therapeutic equipment,medical imaging machines, and/or other medical sensor. Examples ofmedical sensors include electrocardiogram (ECG) sensors, blood pressuresensors, and/or pulse oximetry sensors.

An electrocardiogram sensor records electrical activity of the heartover time through skin electrodes. An electrocardiogram sensor mayoutput tracings of heartbeats. A tracing of a normal heartbeat typicallyincludes a P-wave, a QRX complex (that includes an R-wave), and aT-wave. Samples may record any suitable features of the tracings. Forexample, samples may record intervals between features of consecutiveheartbeats, such as the interval between two consecutive R-waves. Theinterval may be used to extract information about heart rate and itsvariability.

A blood pressure sensor may use a sphygmomanometer to measure bloodpressure. The measurement may include systolic and/or diastolic valuesin units of millimeters of mercury (mmHg). In certain instances, bloodpressure may be measured at regular intervals throughout the day andnight.

A pulse oximetry sensor may be used to measure the oxygenation ofhemoglobin. A pulse oximeter may be placed on the skin (such as afingertip) and transmit a red and an infrared wave. The absorption ofthe waves may be measured to determine oxygen saturation. The pulseoximeter may output the oxygen saturation as a percentage from zeropercent to ninety-nine percent.

Environmental sensors may measure an environmental feature, for example,such as geographic location, air pressure, elevation, and/ortemperature. Examples of environmental sensors include a GlobalPositioning System (GPS) that determines location, a barometer thatmeasures air pressure, an altimeter that measures elevation, and athermometer that measures temperature.

Sensor data 40 may include any suitable information. In certainembodiments, sensor data 40 records measurements taken by one or moresensors 50. Sensor data 40 may include samples that may have anysuitable format. In certain embodiments, the format of the samples maybe a tuple (or ordered set) that has one or more data parameters, and aparticular sample may be a tuple of one or more values for the one ormore data parameters. For example, a tuple format (t, p) may have dataparameters time t and pressure p, and a particular sample (t0, p0) mayhave values pressure p0 measured at time t0.

The tuple format may include any suitable data parameters, such as oneor more sensor parameters and/or one or more test parameters. A sensorparameter may correspond to one or more sensors 50, and a sensor valuemay record one or more measurements taken by one or more sensors 50. Forexample, a sensor value may record a measurement taken by a sensor 50. Atest parameter may correspond to a factor that describes a temporal,spatial, and/or environmental feature of the measurement process, and atest value may record the value of the feature when the measurements aretaken. For example, the parameter may be time, and the parameter valuemay record a particular time at which measurements are taken.

Examples of temporal features include time, which may be expressed as anabsolute time (for example, 2:00 PM, May 25, 2010) or as relative time(for example, time elapsed from a starting time or time remaining untilan ending time). Examples of spatial features include location, such asgeographical location (which may include longitude, latitude, and/oraltitude), location on a body (for example, a human body), and type oflocation (for example, rural or urban). Examples of environmentalfeatures describe physical characteristics of an environment, such astemperature (for example, atmospheric temperature or body temperature).

Model sensor data 40 describes sensor data that may be used to annotatesensor data obtained from measurements in order to categorize the data.For example, certain model sensor data may be categorized and annotatedwith a “normal” (or similar) annotation, while other sensor data may becategorized and annotated with an “abnormal” (or similar) annotation.Sensor data obtained from measurements that match the normal modelsensor data may be categorized as normal, while measured sensor datathat match abnormal model sensor data may be categorized as abnormal.

Any suitable annotation may be used. In certain embodiments, medicalannotations that may be used to categorize medical sensor data. Examplesof medical annotations may include a “normal” annotation for normalsensor data and an “abnormal” annotation for abnormal sensor data. Otherexamples of medical annotations may include annotations for particulardiseases, conditions, symptoms, severity, and/or other category ofmedical sensor data.

In certain embodiments, environmental annotations that may be used tocategorize environmental sensor data. Examples of environmentalannotations may include a “normal” annotation for normal sensor data andan “abnormal” annotation for abnormal sensor data. Other examples ofenvironmental annotations may include annotations for particular weatherconditions, geographical features, social conditions, and/or othercategory of environmental sensor data.

Model sensor data includes model samples. A model sample comprises atuple of one or more model sensor values. A model sensor valuerepresents one or more measurements that could have been taken by one ormore sensors. The model samples may be annotated with the annotation toindicate the category to which it belongs.

Binary decision diagram library 41 stores binary decision diagrams. Incertain embodiments, a binary decision diagram (BDD) is a rooteddirected acyclic graph (DAG) that may be used to represent a Booleanfunction ƒ. A BDD includes nodes, such as non-terminal (or decision)nodes and terminal nodes, where terminal nodes include root nodes. Anon-terminal node corresponds to a sub-function ƒ and is labeled by aBoolean variable v=x_(i). A non-terminal node has an outgoing 1-edge andan outgoing 0-edge pointing to child nodes. A 1-edge points to thesub-BDD that represents function v·ƒ, and a 0-edge 88 points to thesub-BDD that represents function v·ƒ. In other words, a 1-edgerepresents an assignment of v to 1, and a 0-edge represents anassignment of v to 0. Terminal nodes include a 0-terminal and a1-terminal that represent Boolean functions 0 and 1, respectively.

A path from the root node to the 1-terminal represents a set of variableassignments setting the represented Boolean function to 1. A path fromthe root node to the 0-terminal represents a set of variable assignmentssetting the represented Boolean function to 0.

In certain embodiments, a BDD is stripped of redundant decision nodesand subgraph isomorphisms. In certain embodiments, an ordered binarydecision diagram (OBDD) is a BDD where all paths from the root node tothe terminal nodes examine variables in the same order. OBDDs may allowfor canonical representations. Other examples of BDDs include reducedordered BDDs (ROBDDs), partitioned ordered binary decision diagrams(POBDDs), zero-suppressed decision diagrams (ZDDs), nano binary decisiondiagrams (nanoDDs), zero-suppressed nano binary decision diagrams(nanoZDDs), other suitable binary decision diagrams, and/or acombination of any of the preceding. In a ROBDD, isomorphic subgraphsare not present, and the order of the variables from the root node ofthe BDD to a terminal node are the same for all paths. In a ZDD, a nodeis excluded if the node is a negative literal. In other RBDDs, a node isexcluded if both edges of the node point to the same node. Examples ofthe other types of BDDs are described in more detail below.

In certain embodiments, node structure of BDD library 41 includes anysuitable information, such as information about each binary variable andindices to the nodes that correspond to the two possible evaluations ofthe variable. BDD library 41 may also include information aboutcomplementation of one of the indices.

In certain embodiments, BDD library 41 may store the informationcompactly. In certain embodiments, BDD library 41 may maintain theindices and variable identifiers as a function of the size of the BDD.For example, a BDD may have at most k nodes throughout some or allmanipulations performed by BDD library 41. Each vertex of the BDD may belabeled with one of at most v variable identifiers.

The indices to nodes therefore require at most ┌ log(v)┐ bits to indexany variable. The node therefore requires only 2·┌ log(k)┐+┌ log(v)┐bits. In addition, two bits may be reserved, one bit used to identifycomplemented edges and another bit used as a general mark bit usedduring garbage collection. Values for v and k may be determined in anysuitable manner. As an example, a user may specify v and a default kvalue may be used initially. When the address space allowed by thedefault k value is exhausted, the k value may be increased and the nodetable may be rebuilt. As another example, maximum values for v and k maybe assumed.

In certain embodiments, BDD generator 30 accesses a set of samples ofsensor data that records measurements taken by one or more sensors. BDDgenerator 30 represents each sample as a minterm to yield a set ofminterms. BDD generator 30 generates a characteristic function from theminterms, the characteristic function indicating whether a given mintermis a member of the set of minterms.

In certain embodiments, a characteristic function ƒ^(S) of a set Sindicates whether a given natural (represented by a minterm) is a memberof a set S. In certain embodiments, characteristic function ƒ^(S)({rightarrow over (x)}) of a set S⊂IN may be the Boolean function such thatƒ^(S)({right arrow over (x)})=1 iff {right arrow over (x)} is the binaryrepresentation of an element of S. For example, for S={1,3},ƒ(0,0)=ƒ(1,0)=0 and ƒ(0,1)=ƒ(1,1)=1.

A minterm is a logical expression of n variables that employs only thecomplement operator and the conjunction operator. For a Boolean functionof n variables, a minterm is a product term in which each of the nvariables appears once, either in a complemented or uncomplemented form.

In certain embodiments, query engine 32 receives a search query for asearch of a set of samples of sensor data. The search query indicatesone or more requested values of one or more parameters. The samples arerepresented by a characteristic function indicating whether a givenbinary representation represents a sample of the set of samples. Queryengine 32 formulates a query function representing the requested values.Query engine 32 uses the query function and the characteristic functionto identify one or more samples that have the one or more requestedvalues.

In certain embodiments, model engine 34 accesses one or more sets ofmodel samples of model sensor data. Each set comprises model samples fora corresponding annotation of one or more annotations. Model engine 34performs the following for each set to yield one or more annotated modelcharacteristic functions: represent each model sample of the each set asa model minterm to yield a set of model minterms; generate a modelcharacteristic function from the model minterms, the characteristicfunction indicating whether a given minterm is a member of the set ofmodel minterms; and annotate the model characteristic function. Modelengine 34 generates a general model characteristic function from theannotated model characteristic functions.

In certain embodiments, signature engine 34 receives a first Booleanfunction and a second Boolean function, such as first and secondcharacteristic functions. Signature engine 34 transforms the first andsecond Boolean functions to yield first and second arithmetic functions,respectively. Signature engine 34 provides the same input to the firstand second arithmetic functions to calculate first and second hash codes(or “signatures”), respectively. If the first hash code equals thesecond hash code, signature engine 34 designates the first and secondBoolean functions as equivalent. Otherwise, signature engine 34designates that the first and second Boolean functions as notequivalent.

In certain embodiments, interface 20 receives input from, for example, auser, using any suitable input device and communicates the input tocomputing system 22. Interface 20 receives output from computing system22 and communicates the output to computing system 22 using any suitableoutput device.

FIG. 2 illustrates an example of a method for representing sensor databy characteristic functions. The method may be performed by BDDgenerator 30. In the method, sensor data 40 is accessed at step 110.Sensor data 40 records measurements taken by sensors 50. For example,sensor data 40 may record measurements taken by a first sensor 50 with afrequency of 1 Hertz and a second sensor 50 with a frequency of 60Hertz.

A set S of samples is generated from sensor data 40 at step 114. Incertain embodiments, each sample comprises a tuple of one or more sensorvalues. Each sensor value records one or more measurements taken by oneor more sensors at a test value of a test parameter. The test parametermay represent time, temperature, or location. The sample tuple may alsoinclude the test value.

Set S may be generated in any suitable manner. In certain embodiments,time may be quantized according to the sampling frequencies of sensors50 and/or desired accuracy. For each time t_(i), set S of sensor valuesis obtained to yield S={(t_(i), q_(i) ¹, . . . , q_(i) ^(k))}, where qq_(i) ^(j) is the quantized input from sensor j at time instance i.

Each sample is represented as a minterm at step 118. The sample may berepresented as a minterm in any suitable manner. In certain embodiments,one or more variables are allocated to each data value (for example, atest or sensor value) of a sample. For example, Nt (for example, Nt=32)variables may be allocated for time, Ns1 (for example, Ns1=8) variablesfor the first sensor, and Ns2 (for example, Ns2=8) variables for thesecond sensor. In the example, the sample corresponds to a minterm ofthe form t₁ . . . t_(Nt)·s₁ ¹ . . . s_(Ns1) ¹ . . . s₁ ² . . . s_(Ns2)², for example, t₁t₂ . . . t₃₂·s₁ ¹ . . . s₈ ¹ . . . s₁ ² . . . s₈ ².

Each sensor value is expressed as a binary number using the allocatedvariables. In the example, a subset of S may be {(1,70,3), (2,70,3),(3,70,4)}. The related minterms are:00000000000000000000000000000001·01000110·00000011,00000000000000000000000000000010·01000110·00000011,00000000000000000000000000000011·01000110·00000100.

Characteristic function ƒ^(S) is generated from the minterms at step122. Characteristic function ƒ^(S) indicates whether a given minterm isa member of the set of minterms. Characteristic function ƒ^(S) may begenerated from the minterms in any suitable manner. In certainembodiments, a logical operation may be applied to the minterms togenerate characteristic function ƒ^(S). A logical operation may be oneof or a logical combination of any two or more of the following: AND,OR, XOR, and NOT. In certain embodiments, a logical OR operation may beapplied to the minterms to generate characteristic function ƒ^(S).Applying a logical OR operation to a number of operands yields thelogical OR of the operands. The corresponding characteristic functionƒ^(S)({right arrow over (x)}; {right arrow over (s)}¹; {right arrow over(s)}²) is the logical OR of all minterms.

There may be next samples of sensor data 40 at step 126. For example,there may be newer, or more recent, samples in sensor data 40. If thereare next samples, the method returns to step 114 to generate a next setS of the next samples. If there are no next samples, the method proceedsto step 130.

Characteristic function ƒ^(S) is updated using the next samples at step130. Characteristic function ƒ^(S) may be updated in any suitablemanner. In certain embodiments, steps similar to steps 114 through 122may be performed. In the embodiments, a set of next samples may begenerated. Each next sample may be represented as a next minterm.Characteristic function ƒ^(S) may be updated using the next minterms.For example, a logical operation (such as a logical OR operation) may beapplied to characteristic function ƒ^(S) and the next minterms to yieldan updated characteristic function ƒ^(S).

Characteristic function ƒ^(S) is reported at step 134. Characteristicfunction ƒ^(S) may be reported in any suitable manner. For example, BDDgenerator 30 may facilitate display of characteristic function ƒ^(S) atinterface 20.

FIG. 3 illustrates an example of a method for querying sensor datarepresented by characteristic functions. The method may be performed byBDD generator 30. In the method, a search query is received at step 210.The search query requests a search of a set of samples of sensor data 40and may have any suitable format. In certain embodiments, the searchquery may indicate one or more requested values of one or more dataparameters, and may request retrieval of samples that satisfy therequested values. A data parameter may be a sensor parameter thatcorresponds to a sensor and/or a test parameter that describes, forexample, a spatial, temporal, and/or geographical feature. The samplesof sensor data 40 may be represented by a characteristic functionindicating whether a given binary representation represents a sample ofthe set of samples.

Query function ƒ_(R) representing the requested values is formulated atstep 214. Query function ƒ_(R) may be used to identify samples(represented by a characteristic function) that have the requestedvalues. Query function ƒ_(R) may be formulated in any suitable manner.In certain embodiments, each requested value may be expressed as arequested minterm, and a range query function ƒ_(R) may be formulatedfrom the requested minterms. For example, if the requested values aret=128 through 255, then query function ƒ_(R)({right arrow over(t)};{right arrow over (s)}¹; . . . , {right arrow over (s)}^(N))= t ₃₁t ₃₀ . . . t ₈t₇.

As an another example, the following method may be used to generate aBDD of a Boolean function TH_(value)(x) that yields 1 when the numberthat is represented in the vector of binary variables x is larger thanor equal to value. For example, TH_(value)(x) may be the following:TH ₅(0,0,0)=0TH ₅(0,0,1)=0TH ₅(0,1,0)=0TH ₅(0,1,1)=0TH ₅(1,0,0)=0TH ₅(1,0,1)=1TH ₅(1,1,0)=1TH ₅(1,1,1)=1

According to the method, a BDD of a Boolean function TH_(value) (x) maybe generated by:

BDD threshold(value, bits)

{ result = 1 while(bits>0) { bits = bits − 1 if(value mod 2 = 1) result= result AND var_(bits) else if(result <> 1) result = result ORvar_(bits) value = value / 2 } return result }

The generated BDD may be used to count the number of instances where thenumber that is represented in the vector of binary variables x is largerthan or equal to value. As another example, if samples where sensor jhas values between A and B are requested, the following query functionmay be used: ƒ_(R)({right arrow over (t)}; {right arrow over (s)}¹; . .. {right arrow over (s)}^(N))=TH_({A})({right arrow over (s)}^(j)) ANDTH _({B+1})({right arrow over (s)}^(j)). As another example, if therequested values are all sensor values, the query function may be blank.

The query function and the characteristic function are used at step 218to yield search results. The query function and the characteristicfunction may be used in any suitable manner. In certain embodiments, thequery function and the characteristic function may be logically combinedby applying a logical operation (such as a logical AND operation) to thefunctions. For example, characteristic function ƒ^(S) may be ANDed withquery function ƒ_(R)({right arrow over (t)})= x ₁ x ₂ x ₃ . . . x ₂₄.Applying a logical AND operation to a number of operands may yield thelogical AND of the operands. The search results may be one or moresamples that have the requested values or may be the number of samplesthat have the requested values.

In certain embodiments, the number of samples that have the requestedvalues may be determined. The number may be determined in any suitablemanner. For example, Boolean function TH_(value)(x) may yield 1 when thenumber that is represented in the vector of binary variables x is largerthan or equal to value. The number of instances where a BDD representingTH_(value)(x) yields 1 may be counted to determine the number ofinstances where x is larger than or equal to value.

The search results are reported at step 222. The search results mayreported in any suitable manner. For example, the search results may beexpressed as binary decision diagrams.

FIG. 4 illustrates an example of a method for annotating characteristicfunctions. The method may be performed by model engine 34. In themethod, model sensor data is accessed at step 310. In certainembodiments, one or more sets of model samples of model sensor data 40may be accessed. Each set comprises model samples for a correspondingannotation of one or more annotations.

Annotated model characteristic function ƒ^(ai) is generated for anannotation a_(i) at step 314. Annotated model characteristic functionƒ^(ai) represents model samples annotated with one or more annotations.Annotated model characteristic function ƒ^(ai) may be used to identifymeasured samples that belong to the category indicated by theannotation.

Annotated model characteristic function ƒ^(ai) may be generated in anysuitable manner. In certain embodiments, each model sample of annotationa_(i) may be represented as a model minterm, and model characteristicfunction ƒ^(ai) may be generated from the model minterms by, forexample, applying a logical operation (such as a logical OR operation)to the minterms. Model characteristic function ƒ^(ai) indicates whethera given minterm is a member of the model minterms.

In the embodiments, model characteristic function ƒ^(ai) may beannotated to yield annotated model characteristic function ƒ^(ai). Modelcharacteristic function ƒ^(ai) may be annotated in any suitable manner.In certain embodiments, a Boolean variable is used to representannotation a_(i). A mathematical operation (such as the productoperation) may be applied to the Boolean variable and the modelcharacteristic function yield the annotated model characteristicfunction.

In an example scenario, time has a 32-bit resolution, and sensors havean 8-bit resolution. The k^(th) sensor values [64,127] at time [0,31]may be annotated with the normal attribute as follows:ƒ^(a) ^(normal) ({right arrow over (t)}; s ^({right arrow over (1)}) ; .. . ; s ^({right arrow over (k)}))= t ₃₁ t ₃₀ . . . t ₆ t ₅ s ₇ ^(k) s ₆^(k)

There may be a next annotation a_(i+1) at step 316. If there is a nextannotation a_(i+1), the method returns to step 314 to generate a modelcharacteristic function ƒ^(ai+1) for next annotation a_(i+1). If thereis no next annotation a_(i+1), the method proceeds to step 318.

General model characteristic function g is generated from annotatedmodel characteristic functions ƒ^(a) at step 318. General modelcharacteristic function g may represent some or all sensor data 40.General model characteristic function g may be used to annotate samplesof a given characteristic function, which is described in more detailbelow.

General model characteristic function g may be generated in any suitablemanner. In certain embodiments, a logical operation (such as a logicalOR operation) may be applied to annotated model characteristic functionsƒ^(a) to yield general model characteristic function g:g({right arrow over (a)};{right arrow over (t)}; s^({right arrow over (1)}) ; . . . ; s ^({right arrow over (k)}))=V_(i)ƒ^(a) ^(i)

Characteristic function ƒ^(S) representing samples of sensor data 40 isreceived at step 322. The samples are annotated using the general modelcharacteristic function g at step 326. The samples may be annotated inany suitable manner. In certain embodiments, a mathematical operation(such as a product operation) may be applied to the characteristicfunction and the general model characteristic function to annotate thesamples:ƒ^(Q)=ƒ^(S) ·gAnnotated characteristic function ƒ^(Q) represents samples ofcharacteristic function g annotated with annotations.

Core operations may be performed on annotated characteristic functionƒ^(Q). In certain embodiments, annotated characteristic function ƒ^(Q)may be queried to identify samples that have a particular annotation.For example, a density query can provide information on the percentageof data points annotated as with a particular annotation. As anotherexample, the time range of data points with a particular annotation canbe computed.

The query may be performed in any suitable manner, such as a mannersubstantially similar to that described herein. For example, a queryfunction representing a given annotation may be formulated. The queryfunction and the annotated characteristic function may then be combinedto identify the samples.

The results are reported at step 330. The results may be reported in anysuitable manner. For example, the results may be reported throughinterface 20.

FIG. 5 illustrates an example of a method for determining whetherBoolean functions (such as characteristic functions) are equivalent. Themethod may be performed by signature engine 36.

Signature engine 36 receives a first Boolean function and a secondBoolean function at step 410.

The first Boolean function and the second Boolean function aretransformed to a first arithmetic function and a second arithmeticfunction, respectively, at step 414. The Boolean functions may betransformed in any suitable manner. For example, the transformations maybe performed according to one or more of the following rules:X AND Y→X×YX OR Y→X+Y−X×YNOT(X)→1−XX AND X(idempotence)→X×X=X; X ^(k) =Xwhere → represents “is transformed to”, AND represents logical AND, ORrepresents logical OR, NOT represents logical negation, X AND Xrepresents idempotence, × represents multiplication, + representsaddition, and a superscript represents an exponent. For example, ifBoolean function F=X OR Y, then the arithmetic function A[F]=X+Y−X×Y. Incertain examples, if the finite integer field has a size p, thearithmetic is performed modulo p.

In certain cases, a hash code H may be determined for a logicalcombination of Boolean functions B1 and B2. Hash code H may bedetermined from an arithmetic combination of hash codes H1 and H2, whereH1 is the hash code of Boolean function B1 and H2 is the hash code ofBoolean function B2. For example, the theorem of orthogonality may beapplied. If Boolean functions B1 and B2 do not overlap in time, thenhash code H for B1 V B2=H1+H2=H.

The same input is provided to the first arithmetic function and thesecond arithmetic function to calculate a first hash code and a secondhash code, respectively, at step 418. Any suitable input may be used. Incertain embodiments, the input may be randomly generated integers. Inthe example, if input X=5, Y=7 is provided to arithmetic functionA[F]=X+Y−X×Y, then hash code is A[F]=5+7−5×7=−23.

The first and second hash codes are compared at step 422. The first andsecond hash codes may be equal or not equal at step 426. Hash codes ofequivalent functions are the same, and hash codes of different functionsare probably different. If arithmetic expressions are evaluated in afinite integer field under randomization, then any distinct pair of 2²^(n) Boolean functions almost always maps to distinct hash codes. Theprobability of error is n/(size-of-integer-field), where integer fieldZ_(p)={0, 1, . . . , p−1} and p is prime. As prime p increases, theprobability of error decreases and may be practically close to 0.Accordingly, a larger prime p may be selected to yield more accuratehash codes, and a smaller prime p may be selected to yield less accuratehash codes. In certain embodiments, hash codes may be repeatedlygenerated to improve accuracy. Error decreases exponentially after eachrun. After k runs, error e≦(n/p)^(k).

If the first hash code equals the second hash code at step 426, themethod proceeds to step 430, where signature engine 36 designates thefirst Boolean function and the second Boolean function as equivalent. Ifthe first hash code does not equal the second hash code, the methodproceeds to step 434, where signature engine 36 designates the firstBoolean function and the second Boolean function as not equivalent.

Results are reported at step 438. The results may be reported usinginterface 20.

Examples of the method may be used in any suitable application. As anexample, hash codes may be used to verify communication of Booleanfunctions or BDDs over a communication link (wired and/or wireless) to anetwork node (such as a base station). A sending node may send Booleanfunctions and hash codes of the Boolean functions to a receiving nodeafter every K blocks of data of the Boolean functions. The hash codesmay be encrypted. Multiple hash codes may be sent or the same hash codemay be sent multiple times.

The receiving node may calculate a hash code for a Boolean function andcompare the calculated hash code with the hash code received with theBoolean function. If the hash codes are the same, the receiving node maydetermine that the Boolean function is valid, for example, has beenproperly received. Otherwise, the receiving node may determine that theBoolean function is not valid, for example, has not been properlyreceived and may have been corrupted.

As another example, hash codes may be used to mark and later validatedata stored as Boolean functions or BDDs. A hash code may be calculatedfor a Boolean function and may be stored with or separately from theBoolean function. At a later time, the Boolean function may be validatedusing the stored hash function. A new hash code may be calculated forthe Boolean function and compared with the stored hash code. If the hashcodes are the same, the Boolean function may be regarded as valid, suchas uncorrupted. Otherwise, the Boolean function may be regarded asinvalid, such as corrupted.

More accurate hash codes may be used to mark more important data, andless accurate hash codes may be used to mark less important data. Lessaccurate hash codes may be used if processing power is limited, such asfor storage in mobile phones.

Modifications, additions, or omissions may be made to the systems andapparatuses disclosed herein without departing from the scope of theinvention. The components of the systems and apparatuses may beintegrated or separated. Moreover, the operations of the systems andapparatuses may be performed by more, fewer, or other components. Forexample, the operations of BDD generator 30 and query engine 32 may beperformed by one component, or the operations of BDD generator 30 may beperformed by more than one component. Additionally, operations of thesystems and apparatuses may be performed using any suitable logiccomprising software, hardware, and/or other logic. As used in thisdocument, “each” refers to each member of a set or each member of asubset of a set.

Modifications, additions, or omissions may be made to the methodsdisclosed herein without departing from the scope of the invention. Themethods may include more, fewer, or other steps. Additionally, steps maybe performed in any suitable order.

A component of the systems and apparatuses disclosed herein may includean interface, logic, memory, and/or other suitable element. An interfacereceives input, sends output, processes the input and/or output, and/orperforms other suitable operation. An interface may comprise hardwareand/or software.

Logic performs the operations of the component, for example, executesinstructions to generate output from input. Logic may include hardware,software, and/or other logic. Logic may be encoded in one or moretangible media and may perform operations when executed by a computer.Certain logic, such as a processor, may manage the operation of acomponent. Examples of a processor include one or more computers, one ormore microprocessors, one or more applications, and/or other logic.

In particular embodiments, the operations of the embodiments may beperformed by one or more computer readable media encoded with a computerprogram, software, computer executable instructions, and/or instructionscapable of being executed by a computer. In particular embodiments, theoperations of the embodiments may be performed by one or more computerreadable media storing, embodied with, and/or encoded with a computerprogram and/or having a stored and/or an encoded computer program.

A memory stores information. A memory may comprise one or morenon-transitory, tangible, computer-readable, and/or computer-executablestorage media. Examples of memory include computer memory (for example,Random Access Memory (RAM) or Read Only Memory (ROM)), mass storagemedia (for example, a hard disk), removable storage media (for example,a Compact Disk (CD) or a Digital Video Disk (DVD)), database and/ornetwork storage (for example, a server), and/or other computer-readablemedium.

Components of the systems and apparatuses may be coupled by any suitablecommunication network. A communication network may comprise all or aportion of one or more of the following: a public switched telephonenetwork (PSTN), a public or private data network, a local area network(LAN), a metropolitan area network (MAN), a wide area network (WAN), alocal, regional, or global communication or computer network such as theInternet, a wireline or wireless network, an enterprise intranet, othersuitable communication link, or any combination of any of the preceding.

Although this disclosure has been described in terms of certainembodiments, alterations and permutations of the embodiments will beapparent to those skilled in the art. Accordingly, the above descriptionof the embodiments does not constrain this disclosure. Other changes,substitutions, and alterations are possible without departing from thespirit and scope of this disclosure, as defined by the following claims.

What is claimed is:
 1. A method by one or more processors associatedwith one or more computing devices, one or more of the processorsincluding a memory, the method comprising: generating, by one or more ofthe processors, a first characteristic function representing a first setof samples and a second characteristic function representing a secondset of samples; transforming, by one or more of the processors, thefirst characteristic function and the second characteristic function toa first arithmetic function and a second arithmetic function,respectively; calculating, by one or more of the processors, a firsthash code and a second hash code from the first arithmetic function andthe second arithmetic function, respectively; if the first hash codeequals the second hash code, designating, by one or more of theprocessors, the first set of samples and the second set of samples asequivalent; and otherwise, designating, by one or more of theprocessors, the first set of samples and the second set of samples asnot equivalent.
 2. The method of claim 1, the generating the firstcharacteristic function further comprising: representing each sample ofthe first set of samples as a minterm to yield a first set of minterms;and generating the first characteristic function from the first set ofminterms, the first characteristic function indicating whether a givenminterm is a member of the first set of minterms.
 3. The method of claim2, the representing each sample further comprising: receiving the eachsample, the each sample comprising one or more sensor values; and foreach sensor value, expressing the each sensor value as a binary numberusing one or more variables allocated to the each sensor value to yieldone or more binary numbers.
 4. The method of claim 2, the generating thefirst characteristic function from the first set of minterms furthercomprising: applying a logical OR operation to the minterms of the firstset of minterms to generate the first characteristic function.
 5. Themethod of claim 1, the calculating the first hash code furthercomprising: calculating a third hash code of a third characteristicfunction; calculating a fourth hash code of a fourth characteristicfunction, the first characteristic function comprising a Booleancombination of the third characteristic function and the fourthcharacteristic function; and calculating the first hash code from anarithmetic combination of the third hash code and the fourth hash code.6. The method of claim 1, each sample of the first sample set comprisinga tuple of one or more sensor values, each sensor value recording one ormore measurements taken by one or more sensors at a parameter value of aparameter.
 7. The method of claim 6, the parameter representing a time,a temperature, or a location.
 8. The method of claim 1, wherein thefirst and second characteristic functions are represented by a firstbinary decision diagram (BDD) and second BDD, respectively.
 9. Themethod of claim 8, wherein the first and second BDD each comprises oneor more rooted directed acyclic graphs representing the first and secondcharacteristic functions, respectively.
 10. The method of claim 9,wherein the first and second BDD each comprise a plurality of nodes anda plurality of edges connecting the nodes thereby forming a plurality ofpaths, each path in the BDD representing a set of variable assignmentssetting the represented characteristic function to either 1 or
 0. 11. Anapparatus comprising: a memory configured to store a first set ofsamples and a second set of samples; and one or more processorsconfigured to: generate a first characteristic function representing thefirst set of samples and a second characteristic function representingthe second set of samples; transform the first characteristic functionand the second characteristic function to a first arithmetic functionand a second arithmetic function, respectively; calculate a first hashcode and a second hash code from the first arithmetic function and thesecond arithmetic function, respectively; if the first hash code equalsthe second hash code, designate the first set of samples and the secondset of samples as equivalent; and otherwise, designate the first set ofsamples and the second set of samples as not equivalent.
 12. Theapparatus of claim 11, the generating the first characteristic functionfurther comprising: representing each sample of the first set of samplesas a minterm to yield a first set of minterms; and generating the firstcharacteristic function from the first set of minterms, the firstcharacteristic function indicating whether a given minterm is a memberof the first set of minterms.
 13. The apparatus of claim 12, therepresenting each sample further comprising: receiving the each sample,the each sample comprising one or more sensor values; and for eachsensor value, expressing the each sensor value as a binary number usingone or more variables allocated to the each sensor value to yield one ormore binary numbers.
 14. The apparatus of claim 12, the generating thefirst characteristic function from the first set of minterms furthercomprising: applying a logical OR operation to the minterms of the firstset of minterms to generate the first characteristic function.
 15. Theapparatus of claim 11, the calculating the first hash code furthercomprising: calculating a third hash code of a third characteristicfunction; calculating a fourth hash code of a fourth characteristicfunction, the first characteristic function comprising a Booleancombination of the third characteristic function and the fourthcharacteristic function; and calculating the first hash code from anarithmetic combination of the third hash code and the fourth hash code.16. The apparatus of claim 11, each sample of the first sample setcomprising a tuple of one or more sensor values, each sensor valuerecording one or more measurements taken by one or more sensors at aparameter value of a parameter.
 17. The apparatus of claim 16, theparameter representing a time, a temperature, or a location.
 18. One ormore non-transitory computer-readable media storing code, when executedby one or more processors, configured to: generate a firstcharacteristic function representing a first set of samples and a secondcharacteristic function representing a second set of samples; transformthe first characteristic function and the second characteristic functionto a first arithmetic function and a second arithmetic function,respectively; calculate a first hash code and a second hash code fromthe first arithmetic function and the second arithmetic function,respectively; if the first hash code equals the second hash code,designate the first set of samples and the second set of samples asequivalent; and otherwise, designate the first set of samples and thesecond set of samples as not equivalent.
 19. The media of claim 18, thegenerating the first characteristic function further comprising:representing each sample of the first set of samples as a minterm toyield a first set of minterms; and generating the first characteristicfunction from the first set of minterms, the first characteristicfunction indicating whether a given minterm is a member of the first setof minterms.
 20. The media of claim 19, the representing each samplefurther comprising: receiving the each sample, the each samplecomprising one or more sensor values; and for each sensor value,expressing the each sensor value as a binary number using one or morevariables allocated to the each sensor value to yield one or more binarynumbers.
 21. The media of claim 19, the generating the firstcharacteristic function from the first set of minterms furthercomprising: applying a logical OR operation to the minterms of the firstset of minterms to generate the first characteristic function.
 22. Themedia of claim 18, the calculating the first hash code furthercomprising: calculating a third hash code of a third characteristicfunction; calculating a fourth hash code of a fourth characteristicfunction, the first characteristic function comprising a Booleancombination of the third characteristic function and the fourthcharacteristic function; and calculating the first hash code from anarithmetic combination of the third hash code and the fourth hash code.23. The media of claim 18, each sample of the first sample setcomprising a tuple of one or more sensor values, each sensor valuerecording one or more measurements taken by one or more sensors at aparameter value of a parameter.
 24. The media of claim 23, the parameterrepresenting a time, a temperature, or a location.